Video feature extraction method and device

ABSTRACT

A video feature extraction method and device are provided. The method includes extracting one or more frames from a video object to obtain one or more frames of images, obtaining one or more shift vectors for each of the one or more frames of images, using each of the one or more shift vectors, taking any pixel in each of the one or more frames of images as a starting point, determining a terminal point directed by the one or more shift vectors from the starting point, and determining an image feature of the one or more frames of images according to a difference between the starting point and the terminal point, and determining a video feature of the video object according to the image feature of the one or more frames of images.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of priority to Chinese PatentApplication with the application number 201810271773.1, filed Mar. 29,2018. The entire content of this application is hereby incorporated byreference herein.

TECHNICAL FIELD

The disclosure relates to the technical field of video processing, andmore particularly to a video feature extraction method and device.

BACKGROUND ART

In the current multimedia information society, users upload massivevideos to a video platform every day. Most of these videos are normaland valuable videos. However, there are also some defective videos. Thedefective videos mainly include: videos repeated with existing videos ina video database of the platform, videos repeated with videos in acopyright database (such as videos needing royalties) and some videosnot suitable for being displayed or forbidden to be displayed.Therefore, it is necessary to quickly compare and filter the massivevideos uploaded by the users. The core technology to improve the speedand accuracy of video comparison is to reasonably extract and comparefeatures of video frames and determine similarities.

In order to improve the speed and accuracy of comparison, it isnecessary to generate, for one video, a video feature that cancharacterize the video, so as to determine the similarity of the videosby comparing the video features of two videos. An extraction method ofthe video features and the excellence degree of the video featuresdetermine the efficiency and accuracy of the video comparison.

SUMMARY

The object of the disclosure is to provide a new video featureextraction method and device.

The purpose of the disclosure is realized by the following technicalsolutions. The video feature extraction method according to thedisclosure includes the following steps: extracting one or more framesfrom a video object to obtain one or more frames of images; using eachof the one or more shift vectors, taking any pixel in each of the one ormore frames of images as a starting point, determining a terminal pointdirected by the one or more shift vectors from the starting point, anddetermining an image feature of the one or more frames of imagesaccording to a difference between the starting point and the terminalpoint; and determining a video feature of the video object according tothe image features of the one or more frames of images.

The object of the disclosure can be further achieved by the followingtechnical scheme.

In the forgoing video feature extraction method, a length and adirection of the one or more shift vectors are arbitrary.

In the forgoing video feature extraction method, determining an imagefeature of the one or more frames of images according to a differencebetween the starting point and the terminal point includes thatdetermining the difference between each starting point and acorresponding terminal point in one of the one or more frames of imagesfor each of the one or more shift vectors, and determining a feature bitof the one or more frames of images based on an identical shift vectorof the one or more shift vectors according to an overall situation ofeach difference based on the identical shift vector so as to obtain thefeature bit of each of the one or more shift vectors; and determiningthe image feature of the one of the one or more frames of imagesaccording to the feature bit of the one of the one or more frames ofimages based on the one or more shift vectors.

In the forgoing video feature extraction method, obtaining the one ormore shift vectors further includes distributing a counter to each ofthe one or more shift vectors; determining a feature bit of the one ofthe one or more frames of images based on an identical shift vector ofthe one or more shift vectors according to an overall situation of eachdifference based on the identical shift vector includes increasing orreducing a value of the counter corresponding to the one or more shiftvectors according to the each difference of the one or more shiftvectors, and comparing the value of the counter with a predetermined setvalue, and determining the feature bit of the one of the one or moreframes of images based on the one or more shift vectors according to thecomparison result.

In the forgoing video feature extraction method, the difference betweenthe starting point and the terminal point includes a difference betweenbrightness information of the starting point and brightness informationof the terminal point.

The purpose of the disclosure is also realized by using the followingtechnical solutions. The video feature library construction methodincludes the following steps: extracting a video feature of a videoobject according to any one of the foregoing video feature extractionmethod, and storing the video feature into a video feature library.

The purpose of the disclosure is also realized by using the followingtechnical solutions. A video feature extraction device according to thedisclosure includes a frame extractor configured for extracting one ormore frames from a video object to obtain one or more frames of images,an image feature determiner configured for obtaining one or more shiftvectors for each of the one or more frames of images, using each of theone or more shift vectors, taking any pixel in each of the one or moreframes of images as a starting point, determining a terminal pointdirected by the one or more shift vectors from the starting point, anddetermining an image feature of the one or more frames of imagesaccording to a difference between the starting point and the terminalpoint; and a video feature determiner configured for determining a videofeature of the video object according to the image feature of the one ormore frames of images.

The purpose of the disclosure can be further realized by using thefollowing technical scheme.

The video feature extraction device further includes a module forexecuting the step of any video feature extraction methodaforementioned.

The object of the disclosure is also realized by the following technicalsolutions. An audio fingerprint library construction device according tothe disclosure, includes a video feature extractor configured forextracting video features of video objects according to any one theforegoing video feature extraction method; a video feature memoryconfigured for storing the video features into a video feature library;and the video feature library configured for storing the video features.

The object of the disclosure is also realized by the following technicalsolutions. A video feature extraction hardware device according to thedisclosure includes a memory configured for storing non-transitorycomputer readable instructions; and a processor, configured forexecuting the computer readable instructions such that the processorperforms any of the foregoing video feature extraction methods.

The object of the disclosure is also realized by the following technicalsolutions. A computer readable storage medium according to thedisclosure configured for storing non-transitory computer readableinstructions, when the non-transitory computer readable instructions areexecuted by a computer, causes the computer to perform any of theaforementioned video feature extraction methods.

The object of the disclosure is also realized by the following technicalsolutions. A terminal device according to the disclosure includes any ofthe foregoing video feature extraction devices.

The above description is only an overview of the technical solution ofthe disclosure, in order to more clearly understand the technical meansof the disclosure, it can be implemented in accordance with the contentsof the specification, the above and other objects, features andadvantages of the disclosure will become more apparent and understood,the following is a detailed description of the preferred embodiment, andwith reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flow block diagram of a video feature extraction methodaccording to one embodiment of the disclosure.

FIG. 2 is a flowchart of the video feature extraction method provided byone embodiment of the disclosure.

FIG. 3 is a flow block diagram of the process of determining the imagefeatures of a frame of image provided by one embodiment of thedisclosure.

FIG. 4 is a flow block diagram of a video feature library constructionmethod according to one embodiment of the disclosure.

FIG. 5 is a structural block diagram of a video feature extractiondevice according to one embodiment of the disclosure.

FIG. 6 is a structural block diagram of an image feature determineraccording to one embodiment of the disclosure.

FIG. 7 is a structural block diagram of a video feature libraryconstruction device according to one embodiment of the disclosure.

FIG. 8 is a hardware block diagram of a video feature extractionhardware device according to one embodiment of the disclosure.

FIG. 9 is a schematic diagram of a computer readable storage mediumaccording to one embodiment of the disclosure.

FIG. 10 is a structural block diagram of a terminal equipment accordingto one embodiment of the disclosure.

DETAILED DESCRIPTION

In order to further expound the technical means and efficacy taken bythe disclosure for achieving the purpose of the disclosure, the detaileddescription of the video feature extraction method and device accordingto the disclosure, the structure, the features and the effects thereofare described in detail in conjunction with the accompanying drawingsand the preferred embodiments.

FIG. 1 is a schematic block diagram of a video feature extraction methodaccording to an embodiment of the disclosure. FIG. 2 is a schematic flowdiagram of a video feature extraction method according to an embodimentof the disclosure. Referring to FIG. 1 and FIG. 2, a video featureextraction method according to an example of the disclosure mainlyincludes the following steps.

Step S11, one or more frames are extracted from a video object (forexample, a section of video signals or a video file) to obtain one ormore frames of images. The specific number of the extracted frames ofimages may be set/disposed; for example, two frames of images can beextracted from the video in each second, or one frame of image can beextracted from the video in each second. It is noted that the frameextraction can be performed uniformly; that is, the time intervalbetween two adjacent frames of images is consistent. Thereafter, theprocess proceeds to step S12.

Step S12, for each frame of image, one or more shift vectors areobtained, and the number of the obtained shift vectors is assumed to beN, where N is a positive integer. Each shift vector is used, with anypixel in the frame of image as a starting point, to determine a pixel ata terminal point from the starting point directed by the shift vector.The image feature of the frame of image can be determined according tothe difference between the starting point and the corresponding terminalpoint. Thereafter, the process proceeds to step S13.

Specifically, the N shift vectors can be preset or randomly generated.The obtained shift vector is shown as 001 in FIG. 2. Further, the lengthand direction of each shift vector are arbitrary. In addition, the shiftvectors are mutually independent and do not need any correlation. It isnoted that, for the frames of images obtained by extracting frames, theimage feature of each frame of image can be generally determined byusing the same group of shift vectors; however, different groups ofshift vectors can likewise be used for respectively determining theimage features of each frame image.

It should be noted that, by taking any pixel as the starting point, allpixels in the frame of image can be defined as the starting points, orthe pixels in one or more preset positions in the frame of image can bedefined as the starting points, and the specific position can bearbitrary; for example, all points not located on the edge in one frameof image can be taken as starting points.

In a specific example of the disclosure, in the process of determiningan image feature of a frame of image, using each shift vector, eachpixel in the frame of image is taken as a starting point; the differencebetween each pixel at the starting point and the corresponding pixel atthe terminal point is traversed; the feature bit of the frame of imagebased on the shift vector is determined according to the overallsituation of each difference condition based on the same shift vector soas to obtain the feature bit of each shift vector. In one example, thedifference conditions based on the same shift vector can be summed todetermine the overall condition of each difference condition. Then,according to the feature bit corresponding to the obtained N shiftvectors, an image feature (which may also be referred to as a videofeature singleton) whose length is N corresponding to the frame of imageis determined. The obtained image feature whose length is N is shown as004 in FIG. 2.

In some embodiments of the disclosure, the difference between thestarting point and the terminal point includes the difference betweenthe brightness information of the pixel at the starting point and thebrightness information of the pixel at the terminal point.

Step S13, a video feature of the video object is determined according toimage features of each frame of image corresponding to the video object.Specifically, the image features can be arranged according to the timesequence of the corresponding frames of images to obtain an imagefeature sequence, and the image feature sequence is determined as avideo feature.

In the video feature extraction method in the disclosure, the pixel atthe starting point/starting point pixel and the pixel at the terminalpoint/terminal point pixel in the frame the image extracted from thevideo object are determined according to the shift vector with randomlength and direction, and the difference situation between the startingpoint pixel and the terminal point pixel is compared to generate thevideo feature, so that the accuracy and the extraction efficiency of thevideo feature extraction can be improved, and the performance of theobtained video feature can be improved. As a result, the videocomparison, video retrieval, video deduplication, and video contentmonitoring based on the video features obtained by the video featureextraction method of the disclosure can acquire higher accuracy andefficiency.

FIG. 3 is a schematic flow diagram of determining an image feature of aframe of image provided by an embodiment of a video feature extractionmethod according to the disclosure. Referring to FIG. 2 and FIG. 3, inan embodiment of the disclosure, the specific process of determining oneimage feature of a frame of image by using N shift vectors in step S12includes following steps.

Step S21, each of the N shift vectors is assigned a counter. Thereafter,the process proceeds to step S22.

Step S22, according to a shift vector, with respect to any pixel in aframe of image, the pixel is taken as a starting point to determine thepixel at the terminal point directed by the shift vector can bedetermined, and the brightness value of the starting point and that ofthe terminal point are obtained so as to determine the differencecondition of the brightness values between the starting point and theterminal point; for example, the difference conditions contain the twoconditions of the starting point brighter than the terminal point andthe terminal point brighter than the terminal point, and the value ofthe counter corresponding to the shift vector is increased or reducedaccording to the difference condition of each starting point based onthe shift vector. Thereafter, the process proceeds to step S23.

Specifically, as 002 shown in FIG. 2, in one example, any pixel of theframe of image serves as a starting point, according to a shift vector(the rightmost shift vector in 001 of FIG. 2 is taken), the terminalpoint directed by each starting point can be determined. The brightnessvalue of the starting point and the brightness value of thecorresponding terminal point are compared, and if the brightness valueof the starting point is larger than the brightness value of theterminal point (as shown in FIG. 2, the starting point is lighter thanthe terminal point), the value of the counter is +1; otherwise, if thebrightness value of the starting point is smaller than the brightnessvalue of the terminal point (as the starting point is darker than theterminal point shown in FIG. 2), the value of the counter is −1. Theconditions of all the pixels are counted, and the value of the counteris increased or decreased, and the obtained counter is shown as 003 inFIG. 2.

It should be noted that if the terminal point of the shift vectorexceeds the range of the frame of image, the value of the counter can beunchanged, or the frame of image can be periodically extended; the frameof image is surrounded by identical images to ensure a correspondingpixel existing at the terminal point of the shift vector.

Step S23, the value of the counter and the preset value are compared toobtain a comparison result, and the feature bit based on the shiftvector of the frame of image can be determined according to thecomparison result. Thereafter, the process proceeds to step S24.

Specifically, the set value can be preset to be 0; whether the value ofthe counter is larger than a predetermined set value or not is judged,if the value of the counter is larger than the set value, a feature bitwhose value is 1 is generated; otherwise, a feature bit whose value is 0is generated. As 003 and 004 shown in FIG. 2, if the value of thecounter is +2, a feature bit whose value is 1 is correspondinglygenerated.

Step S24, the image features of the frame of image can be determinedaccording to the feature bits based on the N shift vectors of one frameof image. The obtained image feature is shown as 004 in FIG. 2.

It should be noted that the change rule of the increase and decrease ofthe counter in step S22 is not limited to the manner in the foregoingembodiment; for example, the value of the counter can +1 when thebrightness value of the starting point is smaller than the brightnessvalue of terminal point; otherwise, the value of the counter is −1. Inaddition, for the case when the brightness values of the start point andthe terminal point are equal, it can uniformly set the value of thecounter +1, or can uniformly set the value of the counter −1, or canuniformly set the value of the counter unchanged. Similarly, for stepS23, the value of the feature bit can be determined in other manners;for example, according to the counter after changing the value in stepS22, if the value of the counter is smaller than 0, a feature bit whosevalue is 1 is generated; otherwise, a feature bit whose value is 0 isgenerated.

FIG. 4 is a schematic flowchart of an embodiment of a video featurelibrary construction method according to the disclosure. Referring toFIG. 4, a method for constructing a video feature library according toan example of the disclosure mainly includes the following steps.

Step S31, a video feature of a video object is extracted according tothe steps of the foregoing video feature extraction method exemplifiedin the disclosure. Then, the process will be advanced to step S32.

Step S32, the video feature of the video object is stored into a videofeature library.

It should be noted that the video features in one video feature libraryshould be obtained according to the same feature extraction method. Thatis, in the process of video feature extraction in step S31, frameselection is performed based on the same method in step S11, and thesame shift vector is obtained and the image feature is determined basedon the same method in step S12, and the video features are determinedbased on the same method in step S13. In addition, as time goes on, thevideo feature library may be updated at any time.

FIG. 5 is a schematic structural block diagram of an embodiment of avideo feature extraction apparatus according to the disclosure.Referring to FIG. 5, the video feature extraction apparatus/device 100of the disclosure mainly includes a frame extractor 110 configured forcarrying out frame extraction on a video object to obtain one or moreframes of images, an image feature determiner 120 configured forobtaining one or more shift vectors for each frame of image, using eachshift vector, taking any pixel in the frame of image as a startingpoint, determining a pixel at a terminal point directed by the shiftvector from the starting point, and determining an image feature of theframe of image according to the difference between the starting pointand the terminal point, a video feature determiner 130 configured fordetermining video features of the video object according to imagefeatures of each frame of image corresponding to the video object.

FIG. 6 is a schematic structural block diagram of an image featuredeterminer 120 provided by one embodiment of a video feature extractionapparatus according to the disclosure. Referring to FIG. 6, in oneembodiment of the disclosure, the image feature determiner 120specifically includes a counter generating unit/counter generator 121configured for allocating a counter for each of the N shift vectors, acounting unit/counter 122 configured for determining a pixel at aterminal point directed by a shift vector by taking any pixel in a frameof image as a starting point, and obtaining the brightness value of thestarting point and that of the terminal point so as to determine thedifference condition of the brightness values between the starting pointand the terminal point, and increasing or reducing the value of thecounter corresponding to the shift vector according to the differencecondition of each starting point based on the shift vector, a featurebit determination unit/determiner 123 configured for comparing the valueof the counter with a predetermined set value to obtain a comparisonresult, and determining the feature bit based on the shift vector of theframe of image according to the comparison result, an image featuredetermination unit/determiner 124 configured for determining imagefeatures of the frame of image according to the feature bits based onthe N shift vectors of one frame of image.

FIG. 7 is a schematic structural diagram of one embodiment of a videofeature library construction device of the disclosure. Referring to FIG.7, the video feature library construction device 200 exemplified in thedisclosure mainly includes a video feature extractor 201 configured forextracting video features of video objects according to the steps of theforegoing video feature extraction method of the disclosure a videofeature memory 202 configured for storing the video features into avideo feature library, and a video feature library 203 configured forstoring the video features of the various video objects.

The video feature extractor 201 includes the frame extractor 110, theimage feature determiner 120 and the video feature determiner 130 of theforegoing video feature extraction device exemplified in the disclosure.

FIG. 8 is a hardware block diagram illustrating a video featureextraction hardware device according to an embodiment of the disclosure.As shown in FIG. 8, a video feature extraction hardware device 300comprises a memory 301 and a processor 302. The components in videofeature extraction hardware device 300 are interconnected by a bussystem and/or other form of connection mechanism (not shown).

The memory 301 is configured for storing non-transitory computerreadable instructions. In particular, the memory 301 may include one ormore computer program products that may comprise various forms ofcomputer readable storage media such as volatile memory and/ornonvolatile memory. The volatile memory may include, for example, randomaccess memory (RAM) and/or cache memory (cache) and the like. Thenonvolatile memory may include, for example, a read-only memory (ROM), ahard disk, a flash memory, etc.

The processor 302 may be a central processing unit (CPU) or other formof processing unit with data processing capabilities and/or instructionexecution capabilities, and may control the other components in thevideo feature extraction hardware device 300 to perform the desiredfunctions. In one embodiment of the disclosure, the processor 302 isconfigured for running the computer readable instructions stored in thememory 301 such that the media features are all or a part of the stepsof the method of the video feature extraction of the embodiments of thedisclosure described above with respect to the hardware device 300.

FIG. 9 is a schematic diagram illustrating a computer readable storagemedium according to an embodiment of the disclosure. As shown in FIG. 9,according to the computer readable storage medium 400 according to theembodiment of the disclosure, a non-transitory computer readableinstruction 401 is stored thereon; when the non-transitory computerreadable instruction 401 is executed by a processor, all or a part ofthe steps of the method for the video feature extraction of variousembodiments of the disclosure are executed.

FIG. 10 is a schematic diagram illustrating a hardware structure of aterminal device according to an embodiment of the disclosure. A terminaldevice may be implemented in a variety of forms. The terminal devices inthe disclosure may include, but are not limited to, mobile terminaldevices such as mobile phones, smart phones, notebook computers, digitalbroadcast receivers, PDA (personal digital assistants), PAD (tabletcomputers), PMP (portable multimedia players), navigation devices,vehicle-mounted terminal devices, vehicle-mounted display terminals,vehicle-mounted electronic rearview mirrors, and the like, and fixedterminal devices such as digital TV, desktop computers, and the like.

As shown in FIG. 10, the terminal device 1100 may include a wirelesscommunication unit 1110, an A/V (audio/video) input unit 1120, an userinput unit 1130, a sensing unit 1140, an output unit 1150, a memory1160, an interface unit 1170, a controller 1180, and a power unit 1190.FIG. 10 illustrates a terminal device with various components, but itshould be understood that all illustrated components are not required tobe implemented. More or fewer components may be implemented instead.

The wireless communication unit 1110 enables radio communication betweenthe terminal device 1100 and a wireless communication system or network.The A/V input unit 1120 is configured to receive audio or video signals.The user input unit 1130 may generate key input data according to acommand input by the user to control various operations of the terminaldevice. The sensing unit 1140 detects the current state of the terminaldevice 1100, the position of the terminal device 1100, the orientationof the user on the touch input of the terminal device 1100, theorientation of the terminal device 1100, the acceleration ordeceleration movement of the terminal device 1100, and the like, andgenerates a command or signal for controlling the operation of theterminal device 1100. The interface unit 1170 serves as an interface forconnecting the at least one external device with the terminal device1100. The output unit 1150 is configured to provide an output signal ina visual, audio and/or haptic manner. The memory 1160 may store softwareprograms, such as processing and control operations, executed by thecontroller 1180, or may temporarily store data to be output or output tobe output. The memory 1160 may include at least one type of storagemedium. Moreover, the terminal device 1100 can cooperate with a networkstorage device that performs storage functions of the memory 1160through a network connection. The controller 1180 typically controls theoverall operation of the terminal device. In addition, the controller1180 may include a multimedia module for reproduction or playback ofmultimedia data. The controller 1180 may perform pattern recognitionprocessing to identify handwritten input or image draw inputs that areperformed on the touch screen as characters or images. The power supplyunit 1190 receives external power or internal power under the control ofthe controller 1180 and provides appropriate electrical power requiredto operate the elements and components.

The various embodiments of the video feature extraction method of thedisclosure may be implemented using a computer readable medium such ascomputer software, hardware, or any combination thereof. For a hardwareimplementation, various embodiments of the video feature extractionmethod presented herein may be implemented by using at least one of anapplication specific integrated circuit (ASIC), a digital signalprocessor (DSP), a digital signal processing device (DSPD), aprogrammable logic device (PLD), a field programmable gate array (FPGA),a processor, a controller, a microcontroller, a microprocessor, and anelectronic unit designed to perform the functions described herein. Thevarious embodiments of the video feature extraction method proposed bythe disclosure can be implemented in the controller 1180. For a softwareimplementation, various embodiments of the video feature extractionmethod proposed by the disclosure may be implemented with separatesoftware modules that allow at least one function or operation to beperformed. The software codes can be implemented by softwareapplications (or programs) written in any suitable programming language,which can be stored in the memory 1160 and executed by the controller1180.

Above, a video feature extraction method, apparatus, hardware device,computer readable storage medium, and terminal device according to anembodiment of the disclosure, the multiple types of pooling areperformed stage by stage on the frame images that are obtained by thevideo frame selection to generate the video feature, so that theaccuracy of video feature extraction and the extraction efficiency maybe greatly improved, and the excellence degree and the robustness of theobtained video feature may be increased. Further, the video featureobtained by the video feature extraction method of the disclosure hashigher accuracy, higher efficiency, and better robustness for videocomparison, video retrieval, video deduplication, and video contentmonitoring.

The basic principle of the disclosure is described above with referenceto specific embodiments, but it should be noted that the merits,advantages, effects and the like mentioned in the disclosure are merelyexamples and are not intended to be limiting, and these merits,advantages, effects, and the like are not considered to be possessed byvarious embodiments of the disclosure. In addition, the specific detailsdisclosed above are for illustration purposes only and are not intendedto be limiting, and the details are not limited thereto.

The block diagrams of components, apparatus, devices and systemsinvolved in the disclosure are merely illustrative examples and are notintended to require or imply that connection, arrangement, andconfiguration must be performed in a manner shown in a block diagram. Aswill be recognized by those skilled in the art, the components,apparatus, devices and systems can be connected and arranged in anymanner. Words such as “including”, “comprising”, “having”, and the likeare open words, refer to “including but not limited to,” and may be usedinterchangeably. The terms “or” and “and” as used herein refer to theterms “and/or” and may be used interchangeably unless the contextclearly indicates otherwise. As used herein, the term “such as” refersto the phrase “such as, but not limited to,” and may be usedinterchangeably.

In addition, as used herein, “or” is used in the enumeration of itemsstarting with “at least one” to indicate a separate enumeration suchthat, for example, the enumeration of at least one of A, B, or C means Aor B or C, or AB or AC or BC, or ABC (ie, A, B, and C). Furthermore, theterm “exemplary” does not mean that the described example is preferredor better than other examples.

It should also be noted that, in the systems and methods of thedisclosure, various components or steps may be decomposed and/orrecombined. These decompositions and/or recombination should beconsidered as an equivalent scheme of the disclosure.

Various changes, substitutions and alterations of the techniquesdescribed herein may be made without departing from the teachings of theteachings defined by the appended claims. Furthermore, the scope of theclaims of the disclosure is not limited to the specific aspects of theprocessing, machines, manufacture, events, means, methods, and actsdescribed above. The presently existing or later-to-be-developedprocesses, machines, manufacture, events, compositions, means, methods,or acts may be developed that substantially the same function or achievesubstantially the same result as the respective aspects describedherein. Accordingly, the appended claims are intended to include withintheir scope such processes, machines, manufacture, compositions ofmatter, means, methods, or actions.

The above description of the disclosed aspects is provided to enable anyperson skilled in the art to make or use the disclosure. Variousmodifications to these aspects will be readily apparent to those skilledin the art, and the generic principles defined herein may be applied toother aspects without departing from the scope of the disclosure. Thus,the disclosure is not intended to be limited to the aspects shown hereinbut is to be accorded the widest scope consistent with the principlesand novel features disclosed herein.

The foregoing description has been presented for purposes ofillustration and description. Furthermore, this description is notintended to limit embodiments of the disclosure to the form disclosedherein. While various exemplary aspects and embodiments have beendiscussed above, those skilled in the art will recognize certainvariations, modifications, changes, additions, and sub-combinationsthereof.

1. A video feature extraction method, comprising: extracting one or moreframes from a video object to obtain one or more frames of images;obtaining one or more shift vectors for each of the one or more framesof images, using each of the one or more shift vectors, taking any pixelin each of the one or more frames of images as a starting point,determining a terminal point directed by the one or more shift vectorsfrom the starting point, and determining an image feature of the one ormore frames of images according to a difference between the startingpoint and the terminal point; and determining a video feature of thevideo object according to the image feature of the one or more frames ofimages.
 2. The video feature extraction method according to claim 1,wherein a length and a direction of the one or more shift vectors arearbitrary.
 3. The video feature extraction method according to claim 1,wherein the determining an image feature of the one or more frames ofimages according to a difference between the starting point and theterminal point comprises: determining the difference between eachstarting point and a corresponding terminal point in one of the one ormore frames of images for each of the one or more shift vectors, anddetermining a feature bit of the one of the one or more frames of imagesbased on an identical shift vector of the one or more shift vectorsaccording to an overall situation of each difference based on theidentical shift vector so as to obtain the feature bit of each of theone or more shift vectors; and determining the image feature of the oneof the one or more frames of images according to the feature bit of theone of the one or more frames of images based on the one or more shiftvectors.
 4. The video feature extraction method according to claim 3,wherein the obtaining one or more shift vectors further comprisesdistributing a counter to each of the one or more shift vectors; whereinthe determining a feature bit of the one of the one or more frames ofimages based on an identical shift vector of the one or more shiftvectors according to an overall situation of each difference based onthe identical shift vector comprises: increasing or reducing a value ofthe counter corresponding to the one or more shift vectors according tothe each difference of the one or more shift vectors; and comparing thevalue of the counter with a predetermined set value, and determining thefeature bit of the one of the one or more frames of images based on theone or more shift vectors according to the comparison result.
 5. Thevideo feature extraction method according to claim 1, wherein thedifference between the starting point and the terminal point comprises adifference between brightness information of the starting point andbrightness information of the terminal point.
 6. (canceled)
 7. A deviceof extracting video feature, comprising: at least one processor; and atleast one memory communicatively coupled to the at least one processorand storing instructions that upon execution by the at least oneprocessor cause the device to: extract one or more frames from a videoobject to obtain one or more frames of images; obtain one or more shiftvectors for each of the one or more frames of images, use each of theone or more shift vectors, take any pixel in each of the one or moreframes of images as a starting point, determine a terminal pointdirected by the one or more shift vectors from the starting point, anddetermine an image feature of the one or more frames of images accordingto a difference between the starting point and the terminal point; anddetermine a video feature of the video object according to the imagefeature of the one or more frames of images.
 8. The device according toclaim 7, wherein the at least one memory further stores instructionsthat upon execution by the at least one processor cause the device to:extract one or more frames from a video object to obtain one or moreframes of images; obtain one or more shift vectors for each of the oneor more frames of images, use each of the one or more shift vectors,take any pixel in each of the one or more frames of images as a startingpoint, determine a terminal point directed by the one or more shiftvectors from the starting point, and determine an image feature of theone or more frames of images according to a difference between thestarting point and the terminal point; determine a video feature of thevideo object according to the image feature of the one or more frames ofimages; wherein a length and a direction of the one or more shiftvectors are arbitrary. 9-10. (canceled)
 11. A computer readable storagemedium, configured for storing non-transitory computer readableinstructions, when the non-transitory computer readable instructions areexecuted by a computer, causing the computer to perform operations, theoperations comprising: extracting one or more frames from a video objectto obtain one or more frames of images; obtaining one or more shiftvectors for each of the one or more frames of images, using each of theone or more shift vectors, taking any pixel in each of the one or moreframes of images as a starting point, determining a terminal pointdirected by the one or more shift vectors from the starting point, anddetermining an image feature of the one or more frames of imagesaccording to a difference between the starting point and the terminalpoint; and determining a video feature of the video object according tothe image feature of the one or more frames of images.
 12. (canceled)13. The video feature extraction method according to claim 2, whereinthe difference between the starting point and the terminal pointcomprises a difference between brightness information of the startingpoint and brightness information of the terminal point.
 14. The videofeature extraction method according to claim 3, wherein the differencebetween the starting point and the terminal point comprises a differencebetween brightness information of the starting point and brightnessinformation of the terminal point.
 15. The video feature extractionmethod according to claim 4, wherein the difference between the startingpoint and the terminal point comprises a difference between brightnessinformation of the starting point and brightness information of theterminal point.
 16. The device according to claim 7, wherein the atleast one memory further stores instructions that upon execution by theat least one processor cause the device to: extract one or more framesfrom a video object to obtain one or more frames of images; obtain oneor more shift vectors for each of the one or more frames of images, useeach of the one or more shift vectors, take any pixel in each of the oneor more frames of images as a starting point, determine a terminal pointdirected by the one or more shift vectors from the starting point, anddetermine an image feature of the one or more frames of images accordingto a difference between the starting point and the terminal point; anddetermine a video feature of the video object according to the imagefeature of the one or more frames of images; wherein the instructionthat upon execution causes the device to determine an image feature ofthe one or more frames of images according to a difference between thestarting point and the terminal point further comprises instructionsthat upon execution by the at least one processor cause the device to:determine the difference between each starting point and a correspondingterminal point in one of the one or more frames of images for each ofthe one or more shift vectors, and determine a feature bit of the one ofthe one or more frames of images based on an identical shift vector ofthe one or more shift vectors according to an overall situation of eachdifference based on the identical shift vector so as to obtain thefeature bit of each of the one or more shift vectors; and determine theimage feature of the one of the one or more frames of images accordingto the feature bit of the one of the one or more frames of images basedon the one or more shift vectors.
 17. The device according to claim 7,wherein the at least one memory further stores instructions that uponexecution by the at least one processor cause the device to extract oneor more frames from a video object to obtain one or more frames ofimages; obtain one or more shift vectors for each of the one or moreframes of images, use each of the one or more shift vectors, take anypixel in each of the one or more frames of images as a starting point,determine a terminal point directed by the one or more shift vectorsfrom the starting point, and determine an image feature of the one ormore frames of images according to a difference between the startingpoint and the terminal point; and determine a video feature of the videoobject according to the image feature of the one or more frames ofimages; wherein the instruction that upon execution causes the device todetermine an image feature of the one or more frames of images accordingto a difference between the starting point and the terminal pointfurther comprises instructions that upon execution by the at least oneprocessor cause the device to: determine the difference between eachstarting point and a corresponding terminal point in one of the one ormore frames of images for each of the one or more shift vectors, anddetermine a feature bit of the one of the one or more frames of imagesbased on an identical shift vector of the one or more shift vectorsaccording to an overall situation of each difference based on theidentical shift vector so as to obtain the feature bit of each of theone or more shift vectors; and determine the image feature of the one ofthe one or more frames of images according to the feature bit of the oneof the one or more frames of images based on the one or more shiftvectors; wherein the instruction that upon execution causes the deviceto obtain one or more shift vectors further comprises instructions thatupon execution by the at least one processor cause the device todistribute a counter to each of the one or more shift vectors; whereinthe instruction that upon execution causes the device to determine afeature bit of the one of the one or more frames of images based on anidentical shift vector of the one or more shift vectors according to anoverall situation of each difference based on the identical shift vectorfurther comprises instructions that upon execution by the at least oneprocessor cause the device to: increase or reduce a value of the countercorresponding to the one or more shift vectors according to the eachdifference of the one or more shift vectors; and compare the value ofthe counter with a predetermined set value, and determine the featurebit of the one of the one or more frames of images based on the one ormore shift vectors according to the comparison result.
 18. The computerreadable storage medium according to claim 11, wherein a length and adirection of the one or more shift vectors are arbitrary.
 19. Thecomputer readable storage medium according to claim 11, wherein thedetermining an image feature of the one or more frames of imagesaccording to a difference between the starting point and the terminalpoint comprises: determining the difference between each starting pointand a corresponding terminal point in one of the one or more frames ofimages for each of the one or more shift vectors, and determining afeature bit of the one of the one or more frames of images based on anidentical shift vector of the one or more shift vectors according to anoverall situation of each difference based on the identical shift vectorso as to obtain the feature bit of each of the one or more shiftvectors; and determining the image feature of the one of the one or moreframes of images according to the feature bit of the one of the one ormore frames of images based on the one or more shift vectors.
 20. Thecomputer readable storage medium according to claim 19, wherein theobtaining one or more shift vectors further comprises distributing acounter to each of the one or more shift vectors; wherein thedetermining a feature bit of the one of the one or more frames of imagesbased on an identical shift vector of the one or more shift vectorsaccording to an overall situation of each difference based on theidentical shift vector comprises: increasing or reducing a value of thecounter corresponding to the one or more shift vectors according to theeach difference of the one or more shift vectors; and comparing thevalue of the counter with a predetermined set value, and determining thefeature bit of the one of the one or more frames of images based on theone or more shift vectors according to the comparison result.
 21. Thecomputer readable storage medium according to claim 11, wherein thedifference between the starting point and the terminal point comprises adifference between brightness information of the starting point andbrightness information of the terminal point.
 22. The computer readablestorage medium according to claim 18, wherein the difference between thestarting point and the terminal point comprises a difference betweenbrightness information of the starting point and brightness informationof the terminal point.
 23. The computer readable storage mediumaccording to claim 19, wherein the difference between the starting pointand the terminal point comprises a difference between brightnessinformation of the starting point and brightness information of theterminal point.
 24. The computer readable storage medium according toclaim 20, wherein the difference between the starting point and theterminal point comprises a difference between brightness information ofthe starting point and brightness information of the terminal point.